Overloads #11

jchitel · 2018-03-01T06:15:19Z

This pull request adds function and generic type overloads to Ren.

To facilitate this unexpectedly complex feature, the entire compiler stack was rewritten according to pure functional principles. The lexer, parser, type checker, and translator are now all purely functional, making use of enforced immutability, avoiding classes, and taking advantage of TypeScript's discriminated union types instead of inheritance.

All of this made it much easier to reason about the compiler logic, and made it ultimately much easier to implement overloads, which were difficult to support in the prior type system. The type system has been completely rewritten as well to make more use of flags, enums, and discriminated unions.

TODO:

Rewrite lexer to be pure
Rewrite parser and syntax to be pure
Rewrite type checker and visitors to be pure and more flexible
Add overload support to type system (partially done)
Rewrite translator to be pure
Add tests for all of the above

This involves quite a few changes: Parser: The parser has been completely redone from the ground up. To facilitate the new compiler restriction of pure functional programming, the parser logic makes use of function composition. The result is a dead-simple API. There are 5 types of parse expressions: tok (tokens), seq (sequences), select (selections), optional, and repeat. Each one has a corresponding function that can be used to build a parse function capable of parsing any type of node. The whole thing is now strongly typed. Sequences are strongly typed by way of overloads, and they are parsed as arrays instead of objects, with a transform function for converting the arrays to objects. The concept of 'soft' and 'definite' are now gone, because that was overkill. The parser will now greedily consume as much input as possible, and only throw an error when all options are exhausted. This means that order needs to be heavily enforced even more than before. The parser also makes use of the new pure lexer as well, so the entire process is purely functional. Additionally, much of the complex logic around the various types of repetitions and left recursions have been replaced with desugaring to simpler constructs, making the full set of logic simpler as a whole. "Abstract" node types are explained in the Syntax Environment section. Syntax: The syntax has now scrapped the use of classes, and now uses interfaces with discriminated unions. The base type of all syntax node types is now NodeBase, which defines only a 'location' property. All sub-interfaces are required to specify a 'syntaxType' property specifying a single specific SyntaxType enum value. This property is the discriminant, which we will make use of in the future. This whole structure is made possible by the extreme flexibility of the parser, which can now return any kind of value, not just class instances. The high-level node types (declarations, types, expressions, and statments) are now just unions of their corresponding node types, and there is one high-level Node type that is the union of all of them. The Program type is now called ModuleRoot, and ModuleRoot and all import and export types are now separated from other declarations, because they are part of their own domain. Syntax Environment: The new parser API is specified in-place, not using functions. This means that node types that reference each other circularly will not work out of the box. We need to make use of mechanisms such as scope hoisting and referencing undeclared variables within functions to make it work properly. The only problem with that is that these mechanisms do not work cross-module. To make this work, we have introduced the concept of a "syntax environment", which is a function that loads any circularly-referencing syntax types on-demand. All of the high-level node types have their parser specification declared within the syntax environment inside functions. All types that are dependent on these do not declare their syntax at the module root, instead declaring them in "register()" functions that declare their dependencies as parameters. The syntax environment's module imports all of these register functions and calls them within the environment function, where it has access to the high-level parse functions. This means that to have access to the parse functions of all syntax types, you need to call the SyntaxEnvironment function, which will return a fresh environment complete with circular references resolved.

…cking process

jchitel added 3 commits January 25, 2018 12:08

refactored tokenizer into 'lexer' which is now PURELY FUNCTIONAL

daf91b0

removing capital P parser to prevent weirdness

c475736

jchitel added the enhancement label Mar 1, 2018

jchitel self-assigned this Mar 1, 2018

jchitel and others added 12 commits April 29, 2018 07:54

added jest config, moving "old" code to src_old for increased clarity

b68bd97

converted everything back to classes, started working on new type che…

3601fdf

…cking process

added namespace declaration syntax

2894497

added high-level semantic types

88fe6fa

renamed "typecheck\: to "semantic"

9ef3b64

added passes skeleton

4afbc6f

changing syntax and parser structure to better support visitors

d5b84e2

added first pass of type checker

50e70fc

started work on resolution pass

a862bba

refactored CoreObject to use set and mutate methods instead of clone

0bc3aa9

added local names to namespaces for declarations

a1bafa8

adding for posterity

e348d82

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Overloads #11

Overloads #11

jchitel commented Mar 1, 2018

Overloads #11

Are you sure you want to change the base?

Overloads #11

Conversation

jchitel commented Mar 1, 2018